Prevalence of nonsensical algorithmically generated papers in the scientific literature
نویسندگان
چکیده
In 2014 leading publishers withdrew more than 120 nonsensical publications automatically generated with the SCIgen program. Casual observations suggested that similar problematic papers are still published and sold, without follow-up retractions. No systematic screening has been performed prevalence of such in scientific literature is unknown. Our contribution 2-fold. First, we designed a detector combs for grammar-based computer-generated papers. Applied to SCIgen, it 83.6% precision. Second, scientometric study 243 detected SCIgen-papers from 19 publishers. We estimate be 75 per million Information Computing Sciences. Only 19% were dealt with: formal retraction (12) or silent removal (34). Publishers serve sometimes sell remaining 197 any caveat. found evidence citation manipulation via edited bibliographies. This work reveals metric gaming up point absurdity: fraudsters publish algorithmically featuring genuine references. It stresses need screen nonsense before peer-review chase Overall, this yet another illustration harmful effects pressure perish.
منابع مشابه
Detection of computer generated papers in scientific literature
Meaningless computer generated scientific texts can be used in several ways. For example, they have allowed Ike Antkare to become one of the most highly cited scientists of the modern world. Such fake publications are also appearing in real scientific conferences and, as a result, in the bibliographic services (Scopus, ISI-Web of Knowledge, Google Scholar,...). Recently, more than 120 papers ha...
متن کاملfrom linguistics to literature: a linguistic approach to the study of linguistic deviations in the turkish divan of shahriar
chapter i provides an overview of structural linguistics and touches upon the saussurean dichotomies with the final goal of exploring their relevance to the stylistic studies of literature. to provide evidence for the singificance of the study, chapter ii deals with the controversial issue of linguistics and literature, and presents opposing views which, at the same time, have been central to t...
15 صفحه اولon the teacher-generated v.s. leaner-generated noticing-the-gap activities in language classes
abstract the purpose of this study is twofold: on the one hand, it is intended to see what kind of noticing-the –gap activity (teacher generated vs. learner generated) is more efficient in teaching l2 grammar in classroom language learning. on the other hand, it is an attempt to determine which approach of the noticing-the-gap- activity is more effective in the long- term retention of grammar...
Non-photorealistic rendering of algorithmically generated trees
NON-PHOTOREALISTIC RENDERING OF ALGORITHMICALLY GENERATED TREES
متن کاملProactive Detection of Algorithmically Generated Malicious Domains
Using an intrinsic feature of malicious domain name queries prior to their registration (perhaps due to clock drift), we devise a difference-based lightweight feature for malicious domain name detection. Using NXDomain query and response of a popular malware, we establish the effectiveness of our detector with 99% accuracy, and as early as more than 48 hours before they are registered. Our tech...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the Association for Information Science and Technology
سال: 2021
ISSN: ['1532-2882', '1532-2890']
DOI: https://doi.org/10.1002/asi.24495